An Overview of Optical Character Recognition Systems Research on Telugu Language
نویسندگان
چکیده
This paper gives an overview on the development process and ongoing research of the optical character recognition (OCR) systems for Telugu Text. The aim of this paper is to provide a starting point for the researchers entering into this field. In this paper, we present the introduction, characteristics of the Telugu language, developmental process of the OCR systems of Telugu language, research work done on Telugu scripts reorganization and scope for the future work in Telugu OCR systems. KeywordsOCR, Segmentation, feature extraction, Connected Component (CC), classification.
منابع مشابه
Optical Character Recognition (OCR) for Telugu: Database, Algorithm and Application
Telugu is a Dravidian language spoken by more than 80 million people worldwide. The optical character recognition (OCR) of the Telugu script has wide ranging applications including education, health-care, administration etc. The beautiful Telugu script however is very different from Germanic scripts like English and German. This makes the use of transfer learning of Germanic OCR solutions to Te...
متن کاملMulti-font Optical Character Recognition System for Printed Telugu Text
The Telugu OCR systems available in the market currently recognize only the specific fonts of Telugu. This paper describes the development of a multi-font OCR system for printed Telugu characters using Artificial Neural Networks. In this system classification of the characters is carried out using multi layer neural network Architecture.
متن کاملEfficient Recognition of Telugu Characters Based on Critical Points Generated Using Morphological Methods
A novel method for recognition of telugu character is proposed in this paper. The proposed method uses extraction of critical points of the characters based on grid and radial intersections analysis. The extracted critical points are classified based on the grid and radial lines, which helps in improving accuracy in recognition of characters. The algorithm is tested on various data sets and the...
متن کاملClassification and Identification of Telugu Handwritten Characters Extracted from Palm Leaves Using Decision Tree Approach
Research in character recognition is very popular for various application potentials in banks, post offices, defense organizations, reading aid for the blind, library automation, language processing and multi-media design. Even though Epigraphical work dealing with stone inscriptions have been analyzed, these have been done largely manually and also on 2D traces. A large collection of these are...
متن کاملOCR of Printed Telugu Text with High Recognition Accuracies
Telugu is one of the oldest and popular languages of India spoken by more than 66 million people especially in South India. Development of Optical Character Recognition systems for Telugu text is an area of current research. OCR of Indian scripts is much more complicated than the OCR of Roman script because of the use of huge number of combinations of characters and modifiers. Basic Symbols are...
متن کامل